Prokaryote clustering based on DNA curvature distributions
نویسندگان
چکیده
Massive determination of complete genome sequences has led to the development of different tools for genome comparisons. Our approach is to compare genomes according to typical genomic distributions of a mathematical function that reflects a certain biological function. In this study we used comprehensive genome analysis of DNA curvature distributions in coding and non-coding regions of prokaryotic genomes to evaluate the assistance of mathematical and statistical procedures. Due to an extensive amount of data we were able to define the factors influencing the curvature distribution in promoter and terminator regions such as growth temperature, genome size, and A+ T composition. Two clusteringmethods,K -means and PAM,were applied and produced very similar clusterings that reflect genomic attributes and environmental conditions of the species’ habitat. © 2008 Elsevier B.V. All rights reserved.
منابع مشابه
Ecologic genomics of DNA: upstream bending in prokaryotic promoters.
After our analysis of the distribution of predicted intrinsic curvature along all available complete prokaryotic genomes, the genomes were divided into two groups. Curvature distribution in all prokaryotes of the first group indicated a substantial fraction of promoters characterized by intrinsic DNA curvature located within or upstream of the promoter region. We did not find this peculiar DNA ...
متن کاملInvolvement of DNA curvature in intergenic regions of prokaryotes
It is known that DNA curvature plays a certain role in gene regulation. The distribution of curved DNA in promoter regions is evolutionarily preserved, and it is mainly determined by temperature of habitat. However, very little is known on the distribution of DNA curvature in termination sites. Our main objective was to comprehensively analyze distribution of curved sequences upstream and downs...
متن کاملVisual Clustering and Exploration of Splicing Sites using DNA Curvature Criteria
The aim of this paper is to explore the clustering capabilities of our visual genome explorer software. Genome3DExplorer is a new modeling and software solution to explore textual and factual genomic data. It offers a powerful and user-centered visualization of this information within an immersive environment. The visualization is based on a graphical paradigm that automatically helps to build ...
متن کاملPHOSPHOLIPID ANALOGUE DISTRIBUTIONS OF IRANIAN ISOLATES OF CANDIDA
The aim of this study was to analyse polar lipids of Candida species isolated from Ahwaz (Iran) by Fast Atom Bombardment Mass Spectrometry (FAB MS). Nine isolates of Candida Sp. were identified by growth at 45°C, production of chlamydoconidia on cornmeal agar, colonial colour on CHROMagar Candida, germ tube production and ID 32C kits. Then polar lipids were extracted from freeze-dried cult...
متن کاملLinguistic Categories as Basins of Curvature
Linguistic grammars do a good job of elucidating the lexical categories and phrasal units of the languages they model. Certain recurrent Connectionist networks can model similar data but it is not easy to discern the abstract structures of the resulting representations. Hierarchical clustering is helpful in this regard, but it often produces implausible clusters and there seems to be no princip...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Discrete Applied Mathematics
دوره 157 شماره
صفحات -
تاریخ انتشار 2009